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Proof assistants and programming languages based on type theories usually come in two flavours: 
one is based on the standard natural deduction presentation of type theory and involves eliminators, 
while the other provides a syntax in equational style. We show here that the equational approach 
corresponds to the use of a focused presentation of a type theory expressed as a sequent calculus. A 
typed functional language is presented, based on a sequent calculus, that we relate to the syntax and 
internal language of Agda. In particular, we discuss the use of patterns and case splittings, as well as 
rules implementing inductive reasoning and dependent products and sums. 


1 Programming with Equations 


Functional programming has proved extremely useful in making the task of writing correct software 
more abstract and thus less tied to the specific, and complex, architecture of modern computers. This, 
is in a large part, due to its extensive use of types as an abstraction mechanism, specifying in a crisp 
way the intended behaviour of a program, but it also relies on its declarative style, as a mathematical 
approach to functions and data structures. However, the vast gain in expressivity obtained through the 
development of dependent types makes the programming task more challenging, as it amounts to the 
question of proving complex theorems — as illustrated by the double nature of proof assistants such as 
Coq ifTll and Agda IITSII . Keeping this task as simple as possible is then of the highest importance, and it 
requires the use of a clear declarative style. 

There are two main avenues for specifying a language of proofs, or programs, that is abstract enough 
to support complex developments involving dependent types. The first approach, chosen by the Coq 
project, is to have a language of tactics that partially automate the construction of proofs — that is, to 
mechanically construct complex programs based on the composition of a few generic commands. While 
this takes the development task closer to the usual idea of proving a mathematical theorem, the second 
approach is to take the programming viewpoint: although Coq allows to directly write proof terms, this is 
better illustrated by Agda, where a syntax inspired by Haskell |!T(| provides a clear equational style. 

Our goal here is to investigate the relations between the equational style of dependently-typed 
functional programming as found in Agda to the proof-theoretical description of intuitionistic logic 
given in the sequent calculus. In particular, we claim that n focused sequent calculus, akin to the LJF 
system of Liang and Miller |[T5l . offers a logical foundation of choice for the development of a practical 
dependently-typed language. We intend to support this claim by showing how the equational syntax of 
Agda and the internal structure of its implementation correspond to a computational interpretation of 
such a calculus — for an extended for of intuitionistic logic including dependencies and (co)induction. 
As it turns out, the use of left rules rather than eliminations for positive connectives such as disjunction, 
in sequent calculus, yields a simpler syntax. In general, beyond the use of spines in applications, as 
in LJT d and quite common in the implementation of functional programming languages or proof 
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assistants, the structure of the sequent calculus is much closer to the equational style of programming than 
natural deduction, the standard formalism in which type theory is usually expressed ifT^ . Using a focused 
system rather than a plain sequent calculus based on LJ provides a stronger structure, and emphasizes the 
importance of polarities, already observed in type theory O. 

Beyond the definition of a logical foundation for a functional language in equational style, giving a 
proof-theoretical explanation for the way Agda is implemented requires to accomodate in the sequent 
calculus both dependent types and a notion of inductive definition. This is not an easy task, although there 
has been some work on dependent types in the sequent calculus ifT^ and there is a number of approaches 
to inductive definitions in proof theory, including focused systems Q . For example, the system found 
in l[T4i is based on LJT but is limited to IT and does not support Z, while lIT^ has both, but requires an 
intricate mixture of natural deduction and sequent calculus to handle £. Induction is even more complex to 
handle, since there are several approaches, including definitions |[T9l or direct least and greatest fixpoints 
as found in /iMALL Q and /rLJ m. From the viewpoint of proof-theory, the least fixpoint operator jj. 
seems to be well-suited, as it embodies the essence of induction, while the greatest fixpoint v allows to 
represent coinduction. However, these operators are not used the same way as inductive definitions found 
in Agda or other languages or proof assistants — they seem more primitive, but the encoding of usual 
constructs in terms of fixpoints is not obvious. Even more complicated is the question of using fixpoints 
in the presence of dependent types, and this has only been studied from the type-theoretic viewpoint 
in complex systems such as the Calculus of Inductive Constructions lITOl . In the end, what we would 
like to obtain is a proof-theoretical understanding of the equational style of dependent and (co)inductive 
programming, related to the goals of the Epigram project. In particular, we consider that the sequent 
calculus, with its use of left rules, provides access to the “left” of equations in a sense similar to what is 
described in ifTTll . 

Here, we will describe the foundamental ideas for using a variant of LJF as the basis for the design of 
a dependently-typed programming language. We start in Sectionj^by considering a propositional system 
and show how the shape of sequent calculus rules allows to type terms in equational style. This is made 
even more obvious by the use of pattern in the binding structure of the calculus. Then, in Sectionj^we 
discuss the extension of this system to support dependent types and induction, problems related to patterns 
in this setting, as well as the question of which proof-theoretical approach to induction and coinduction is 
better suited for use in a such a language. Einally, we conclude by the review of some research problems 
opened by this investigation, and an evaluation of the possible practical applications to languages and 
proofs assistants. 

2 Focusing and Polarities in the Sequent Calculus 

We start our investigation with a propositional intuitionistic system presented as a focused sequent calculus. 
It is a variant of LJF ifTSl to which we assign a term language extending the A-calculus of Herbelin ifTSll . 
Unlike the calculus based on LJT, this system has positive disjunctions and conjunctions V and x, but it 
has no positive atoms. We use the following grammar of formulas: 

N,M ::= a | | P^N \ NAM P,Q ::= |A | PVQ \ PxQ 

where t and j, are called polarity shifts and are meant to maintain an explicit distinction between the 
two categories of formulas, negatives and positives. This is not absolutely necessary, but it clarifies the 
definition of a focused system by linking the, focus and blur rules to actual connectives. Note that this was 
also used in the presentation of a computational interpretation of the full LJF system [^. 
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'F k r/ : [P] 

'F 1 • Ft :N 

'¥,[N]\=e:N 

'FI ■h>d:-lP 

'F k<? : [IN] 

^,x: IN, [N]\=k:M 

^,x:iN \r\-t :M 

'F 1 p:PFt:N 

'FjX : IN ■ \- X k : M 

'F 1 r,x : AN k t : M 

'F, [tP] k Kp.t : N 

\r,p:PPt:N 

'F, [N] k k : L 

'i>,[M]\=k:L 

'F 1 TP lp.t:P-^N 

'F,[NAM] kprl k:L 'F, [N A M] k prr k : L 

'F 1= d : [P] '¥,[N]\=k:M 

'F 1 F Ft :N 

'F 1 F Fm :M 

^>,[P-^N]\=d::k:M 

'F 1 F h {t,u) :NAM 

F k r/ : [P] F k c : [2] 

r\=d:[P] 

F k J : [2] 

rk(r/,c): [Px Q] 

r k ini J : [P V 2] 

F k inr J : [P V 2] 

^ \r,p:P,q:Q\-t:N 

'F 1 r,p :P Ft :N 

'F r,q : Q u : N 

'F 1 r,ip,q):PxQht:N 

'F T,x[p \ q]\ P y Q\- x[t \u]\ N 


Figure 1: Typing rules for a pattern-based A-calculus based on A 


Tbe rules we use in this system are shown in Figure [TJ where the term assignment is indicated in 
red and several turnstiles are used to distinguish an inversion phase h from a focused phase l=. In this 
syntax, brackets are used to pinpoint the precise formula under focus. The extended A-calculus we use to 
represent proofs is based on the following grammar: 


t,u 

\= \> d 

Xp.t 

xk {t,u) 1 X 

p,q 

:= X 

{?,<]) 

x[p 1 q] 

d,e 

:= <it 

{d,e) 

ini d 1 inr d 

k,m 

:= £ 

t :: k 

prl k 1 prr k 


where t denotes a term, p a binding pattern, d a data structure and k an application context. In terms of 
programming, terms are describing computation, mostly by means of functions, while data structures 
implement pairs and constructors. Note that computations can use case splittings | n] to choose between 
the subterms t or u depending on the contents of the data bound to x. The use of patterns rather than plain 
variables to annotate formulas in the context of typing judgement is taken from pSl] and allows to express 
more directly the equational style found in Agda. For example, we could write: 

f : (N X N) tt) N N 
f (ini (x,y)) = X + y 
f (inr z) = z 

to dehne a function / that uses pattern-matching on its argument and computes the result based on the 
components of the data structure it received. Such a function can be written in our calculus as the following 
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term: Xw[{x,y) \ z].w[add ((;c e) :: {y e) :: £) | z £], where add is the name of the addition function. This 
makes the compilation of the code written above to the adequate representation in our calculus relatively 
easy, since different parts of a definition can he aggregated into a term with a pattern and a case splitting. 
This is very much related to the question of compiling pattern-matching into a specific splitting tree where 
case constructs are used Q. 

The idea of the logical approach is that cut elimination in this system yields a reduction system 
implementing the dynamics of computation in the corresponding calculus. In such a focused calculus, a 
number of cut rules are needed to complete the proof of completeness of the cut-free fragment, but only 
two of them really need to be considered as rules — the other cuts can simply be stated as principles, and 
their reduction will correspond to a big step of computation. These two rules are: 

'TNd:[P] ^\T,p:Pht:N 'T|rht:A^ ^,[N]\=k-.M 

^ \ TPp = dint:N ^ \ TPtk:M 

the first one being the binding of a data structure to a matching pattern, and the second a simple application 
of a term to a list of arguments. The latter is already part of the LJT system ifTSl . but the former is specific 
to LJF in the sense that it appears only when formulas can be focused on the right of a sequent. The main 
reduction rule extracted from cut elimination is the A variant of j3-reduction: 

{Xp.t){d::k) —)> {p = dint)k 

but there are a number of other reduction rules generated by the use of other connectives than implication. 
In particular, conjunction yields a form of pairing where a term {t,u) has to be applied to a list prl k 
to reduction to t k. The binding cut is simpler in a certain sense, since its reduction corresponds to a 
decomposition of the data structure d according to the shape of the pattern p, and a simple substitution 
when p is just a variable. Moreover, other cuts encountered during reduction usually amount to a form of 
substitution, except for the one, already present in LJT, that yields lists concatenation in the argument of 
an application. 

Note that the > d construct is present in the internal language of Agda, but the constructs <i t and Kp.t 
are not, although they can be obtained indirectly using a cut. While <i t should simply be understood as 
a thunk, which is a term made into data, the list Kp.t is slightly more complex. This construct, already 
present in 0, is more a context than a list in the sense that it stops the application of a term to Kp.t and 
enforces the execution of t, where the original term applied is bound to p. This can be understood by 
considering the reduction extracted from cut elimination: 

{>d){Kp.t) —)■ p = dirLt 

Finally, note that we could have an explicit contraction rule in the system, that would appear in terms 
under the form of a pattern p® q indicating that p and q will be the patterns associated to two copies of the 
same assumption P. The associated typing rule is: 

\ T,p-.P,q\PPf.N 
'F I T,p®q -. P \- t '.N 

and it is reminiscent of the pattern using the same syntax in Haskell — which is meant to exist in Agda as 
well, but this not yet implemented. However, in Haskell, this is restricted to the form x @ p so that it can 
only serve to name an assumption before decomposing it, and we could allow for such a use by avoiding 
maximal inversion, which is not strictly necessary in a focused system fT]. This rule is not necessary for 
the completeness of the calculus, and there are other ways to obtain the same result. Of course, in a very 
similar way, the pattern _ can be associated to the weakening rule, also admissible. 
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3 Adding Dependent Types and Induction 

We continue our investigation by adapting our variant of LJF to dependent types, but this unveils 
some issues that we will now discuss. On problem we immediately encounter is the adaptation of the 
pattern machinery to the dependent setting, mostly due to the substitutions involved in the types, where 
patterns should have appeared. For the dependent implication n(v : P).N, using a pattern p rather than 
a binding variable v yields the question of substituting a data structure d for p\ this becomes a much 
more complicated operation than the traditional substitution. Moreover, keeping the patterns and variables 
synchronised between their use in terms and in types is a challenging task, that would probably require 
heavy syntactic mechanisms. For this reason, the system shown above in Figure |^has no patterns, but 
rather falls back to the traditional style of typing using only variables to label assumptions. The language 
used in this variant can still be related to the equational approach to functional programming, but the 
translation between equations and terms is more involved. 

The generalisation of the implication into the dependent product n(;c: P).N is a straightforward 
operation, and the rules we use are essentially the ones found in llT4l — except that it involves a data 
structure, corresponding to a focus on the right-hand side of a sequent. Now, the case of £ is more 
complicated, as it is a priori unclear whether it should be obtained as a generalisation of the negative 
conjunction A or of the positive product x and both solutions might even be possible. But a generalisation 
of the negative disjunction seems to be problematic, when it comes to the specification of the second left 
mle, typing the prr operation. Indeed, when focusing on L{x : N).M we would need to plug a term of type 
N for V in M, but this would require to maintain some “natural deduction version ” of the term currently 
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being transformed, and to plug at adequate locations some translation between natural deduction style and 
our sequent calculus syntax — as done in ifT^ . This is quite unsatisfactory and will not help us build a 
proper understanding of dependent types in a pure sequent calculus setting. The solution we adopt here is 
to obtain L{x : P).Q as a generalisation of the positive product x and simply update the corresponding 
rules as shown in Figure The left rule is simple to define in this case, because the decomposition of the 
£ in the context preserves the binding of y in the type Q. 

There is a particularly interesting benefit to the use of the sequent calculus to handle splitting as done 
in the left £ rule. Consider the elimination rule in natural deduction: 

r,x : A VB h C : type ri-f:AVB r,y : A h n : C{inly/x} r,z : B h v : Cjinr z/x} 

Vc- 

r h match [x.C] {t ; y.w, z-v) : C{l/x} 

and observe that it is necessary to be explicit about the return type, since obtaining C from C{t/x] is a 
complicated process, that reverses a substitution. This makes the term syntax heavy, while the problem is 
avoided in the sequent calculus, where no substitution is needed in the conclusion. Note that in Coq, the 
natural deduction style is used for the proof language, but tactics are written in a style that is much closer 
to the sequent calculus — as this is the framework of choice for proof search — so that tactics have to 
perform some kind of translation between the two formalisms. 

At the level of dependent types, there is a number of tricks used in the Agda implementation that 
diverge from the proof-theoretical viewpoint. For example, substitutions in types are treated in a complex 
way and may be grouped together. Although some of the design choices can be justified by a similarity 
to the focused sequent calculus, there is probably a number of implementation techniques that have no 
proof-theoretical foundation. Moreover, we have chosen here a particularly precise framework where 
formulas are explicitly polarised, but in practice types in a programming language should not always 
require these annotations: the question of the presence of specific terms corresponding to shifts is therefore 
not obvious, as it depends if some interesting programming constructs require their presence or their 
absence. One can observe, for example, that in the system proposed here, dependencies are subject to the 
presence of delays, because of the contraction present in the left focus rule and of the treatment of names 
in the Kx.t operation. 

The problem of generalising the equational style of programming associated to the focused sequent 
calculus at the propositional level to the level of dependent types is parametrised by a choice: using patterns 
seems to require a complex tracking mechanism, but provides a relatively direct logical representation 
of equations, while using simple variables leads to a translation overhead. Notice however that one 
could think of an implementation based on variables in which equations are easily obtained, since the 
language would aheady be expressed in the style of the sequent calculus — this is the approach suggested 
by Epigram, where equations are meant to clarify the meaning of programs but are not their internal 
representation. But we now turn to the most challenging task of our whole enterprise: the accomodation 
of induction in the framework of a focused sequent calculus in a form that can be exploited to design 
declarative programs. 

Induction can be expressed in Agda in a concise manner and enjoys the benehts of the equational 
presentation. Consider for example the following inductive scheme for natural numbers: 

indpj : P zero —)■ (n(x:N).Px —P (sue x)) —n(n:N).Pn 

indpj base ih zero = base 

indpj base ih (sue n) = ih n (indpj base ih n) 

where the code essentially relies on the matching of a natural number, that can be either zero or the 
successor of another number. It is not obvious to see through this program and select a particular approach 
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to induction that would be a good candidate for a proof-theoretical description. The natural candidate for a 
representation of induction in the sequent calculus would be the jj. operator as studied in [4| in the setting 
of intuitionistic logic. The unfocused rules for this operator would be, from a purely logical viewpoint: 

rhB{na.B/a} B{C/a}\-C 

T jra.B r,/ia.BI-C 

but the presence of fixpoints has consequences for cut elimination, as it prevents some cuts to be reduced. 
The usual technique applied to avoid this problem is to build the cut rule into the left rule for /i and to 
consider the result as cut free. This way, all the cuts that cannot be reduced further are explicitly attached 
to the blocking rule instance. However, the use of these rules in terms of computation is not obvious 
to specify, in part because of the complexity of the associated cut reduction, that involves the creation 
of several other cuts and appeals to the functoriality of the body B of any hxpoint jXa.B — ensured by 
a positivity condition. In addition, these rules seem to interact poorly with dependent types, as dealing 
with fixpoints will require a complex handling of terms appearing inside types. It is unclear as of now 
if fixpoints as expressed by /r — and v in the case of induction — can fit our scheme of explaining the 
implementation of a language such as Agda, but at the same time there is no obvious pro of-theoretical 
approach that accounts in a straightforward way for the pervasive nature of inductive definitions in the 
internal language of Agda, where they are handled by expansion of names with the body of the definition. 

4 Conclusion and Future Work 

As we have seen here, the A-calculus proposed by Herbelin as an interpretation of the LJT focused 
sequent calculus can be extended beyond its original scope to include positive connectives, leading to a 
full-fledged intuitionistic system where we can focus on the right-hand side of sequents to decompose 
positives. The language we obtain is well-suited to represent programs written in the kind of equational 
style found in Haskell or Agda, the relation to equations can be made even tighter by using patterns 
as labels for assumptions in the type system. The opens up the possibility to select focused sequent 
calculus as a logical framework of choice for the implementation of such languages — as evidenced by the 
current state of the implementation of Agda, containing many elements that can be explained as sequent 
calculus constructs. The benefit could not only be a simplication of such an implication, but possibly an 
improvement in terms of efficiency if advanced techniques from proof theory are transferred and made 
practical. Moreover, one of the strength of the logical approach is that generalisations and extensions of 
all kinds are usually made simpler by the strong principles at work: any kind of progress made on the 
side of proof theory could translate into more expressive languages using the clear equational style of 
Haskell and Agda — that could be modalities, linearity or many other elements studied in the field of 
computational logic. 

The generalisation of this idea to handle dependent types has already been partially investigated, but 
some question are left unresolved as to the specific rules used in such a system, and the possibility of 
making the system more equational by exploiting patterns. But the most difficult task at hand is the 
explanation of the various treatments of induction available in language and proofs assistants in terms 
of the sequent calculus. As observed previously l|2l, the notion of polarity seems to be important in the 
understanding of this question, but unfortunately the proper polarised handling of fixpoints in proof theory 
has yet to be found — a number of choices are left open when it comes to the definition of a focused 
system using fixpoints [O. Note that our enterprise also yields the question of the treatment of the identity 
type in proof theory, as it makes dependent pattern matching admit the axiom K. This axiom is undesirable 
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in homotopy type theory, and thus the restriction of dependent pattern matching has been studied |f9i But 
this was achieved by restricting unification in the splitting rules, and as Agda has no explicit calculus for 
splitting, this was somewhat hidden. The framework we want to develop provides a calculus and could 
thus help making this restriction simpler. 
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